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Abstract 

^ We investigate characteristics of random split trees introduced by 

^ Devroye |2C; split trees include for example binary search trees, m- 

ly-j ary search trees, quadtrees, median of {2k + 1) -trees, simplex trees, 

CN tries and digital search trees. More precisely: We introduce the use 

^ of renewal theory in the studies of split trees, and use this theory to 

P^ prove several results about split trees. A split tree of cardinality n 

Q.^ is constructed by distributing n "balls" (which often represent "key 

(~| numbers") in a subset of vertices of an infinite tree. One of our main 

^ results is to give a relation between the deterministic number of balls 

^ n and the random number of vertices N. In f^jl there is a central 

I— ' limit law for the depth of the last inserted ball so that most vertices 

^-H are close to ^ + Oi \/ln n j , where /i is some constant depending on 

*^ the type of split tree; we sharpen this result by finding an upper bound 

^^ for the expected number of vertices with depths > ^^ + ln*'-^+'^ n or 

^ depths < ^ + In'''^^'^ n for any choice of e > 0. We also find the first 

^ asymptot^c of the vanances of the depths of the balls ^n the tree. 

o 

^ 1 Introduction 

S<^ 1.1 Preliminaries 

^ In this paper we consider random split trees introduced by Devroye [3j. Some 

important examples of split trees are binary search trees, m-axj search trees, 
quadtrees, median of {2k + l)-trees, simplex trees, tries and digital search 
trees. As shown in |3] the split trees belong to the family of so-called logn 
trees, i.e., trees with height (maximal depth) a.a.s. 0{logn). (For the nota- 
tion a.a.s, see [13].) 

The (random) split trees constitute a large class of random trees which 
are recursively generated. Their formal definition is given in the "split tree 
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generating algorithm" below. To facilitate the penetration of this rather 
complex algorithm we will first provide a brief heuristic description. 

A skeleton tree Sb of branch factor b is an infinite rooted tree in which 
each vertex has exactly b children that are numbered 1,2, ...,6. A split 
tree is a finite subtree of a skeleton tree Sb- The split tree is constructed 
recursively by distributing balls one at a time to a subset of vertices of Sb- 
We say that the tree has cardinality n if n balls are distributed. Since many 
of the common split trees come from algorithms in Computer Science the 
balls often represent some "key numbers" or other data symbols. There is 
also a so-called vertex capacity, s > 0, which means that each node can hold 
at most s balls. We say that a vertex f is a leaf in a split tree if the node 
itself holds at least one ball but no descendants of v hold any balls. The split 
tree consists of the leaves and all the ancestors of the leaves, in particular the 
root of Sb, but no descendant of a leaf is included. In this way the definition 
of leaves in split trees is equivalent to the usual definition of leaves in trees. 
See Figure [l] and Figure [2} where two examples of split trees are illustrated 
(the parameters sq and si in the figures are introduced in the formal "split 
tree generating algorithm"). 

The first ball is placed in the root of Sb- A new ball is added to the tree 
by starting at the root, and then letting the ball fall down to lower levels in 
the tree until it reaches a leaf. Each vertex v of 5*^ is given an independent 
copy of the so-called random split vector V = (Vi, V2 . . . , H) of probabilities, 
where Yli Vi = ^ and l^ > 0. The split vectors control the path that the ball 
takes until it finally reaches a leaf; when the ball falls down one level from 
vertex v to one of its children, it chooses the i-th child of v with probability 
Vi, i.e., the i-th component of the split vector associated to v- When a full 
leaf (i.e., a leaf which already holds s balls) is reached by a new ball it splits. 
This means that some of the s + 1 balls are given to its children, leading to 
new leaves so that more nodes will be included in the tree. When all the 
n balls are distributed we end up with a split tree with a finite number of 
nodes which we denote by the parameter N. 

The split tree generating algorithm: The formal, comprehensive 
"split tree generating algorithm" is as follows with the following introductory 
notation. The (random) split tree has the parameters b, n, s and V as we 
described above; there are also two other parameters: Sq, Si (related to the 
parameter s) that occur in the algorithm below. Let n„ denote the total 
number of balls that the vertices in the subtree rooted at vertex v hold 
together, and C^ be the number of balls that are held by v itself. Thus, we 
note that a vertex f is a leaf if and only if C^, = n^ > 0. Also note that a 
vertex v E Sb is included in the split tree if, and only if, n^ > 0. If n^, = 0, 
the vertex v is not included and it is called useless. 



Below there is a description of the algorithm which determines how the 
n balls are distributed over the vertices. Initially there are no balls, i.e., 
Cj, = for each vertex v. Choose an independent copy V^ of V for every 
vertex v G Sb- Add balls one by one to the root by the following recursive 
procedure for adding a ball to the subtree rooted at v. 

1. If f is not a leaf, choose child i with probability V^, and recursively add 
the ball to the subtree rooted at child i, by the rules given in steps [T| 
|2]and[3l 

2. If f is a leaf and C^, = n^ < s, (s is the capacity of the vertex) then 
add the ball to v and stop. Thus, C„ and n^ increase by 1. 

3. If t" is a leaf and 0^ = 71^ = s, the ball cannot be placed at v since it 
is occupied by the maximal number of balls it can hold. In this case 
let n^, = s + 1 and Cy = Sq, by placing Sq < s randomly chosen balls 
at V and s + 1 — Sq balls at its children. This is done by first giving 
Si randomly chosen balls to each of the b children. The remaining 
s + 1 — So — bsi balls are placed by choosing a child for each ball 
independently according to the probability vector Vv = (Vi, V2, . . . , H), 
and then using the algorithm described in steps [T| [2] and [3] applied to 
the subtree rooted at the selected child. Note that if Sq > or si > 0, 
this procedure does not need to be repeated since no child could reach 
the capacity s, whereas in the case sq = this procedure may have to 
be repeated several times. 

From|3j it follows that the integers Sq and Si have to satisfy the inequality 

< So < s, < 6si < s + 1 — So. 

Note that every nonleaf vertex has Cy = Sq balls and every leaf has < C^ < 
s balls. 

Figure [1] shows a split tree with cardinality 32 and parameters (6, s, Sq, Si) = 
(4, 3, 1, 0) and Figure [2] shows a split tree with cardinality 21 and parameters 
(6,s,so,si) = (2,4,0,2). 

We can assume that the components Vi of the split vector V are identically 
distributed. If this were not the case they can anyway be made identically 
distributed by using a random permutation, see |3]. Let \^ be a random 
variable with this distribution. This gives (because Yli ^ = 1) that E(V) = 
^. We use the notation T" to denote a split tree with n balls. However, note 
that even conditioned on the fact that the split tree has n balls, the number of 
nodes A^, is still a random number. The only parameters that are important 



b=4 s_0=l 

s=3 s 1=0 



All internal vertices have s_0=l balls 




All leaves have between 1 and s=3 balls. 
Note that s_l=0. 



Figure 1: This figure illustrates a split tree with parameters 6 = 4, s = 3, sq = 1 
and si = 0. 



All internal vertices have s 0=0 balls 




All leaves have between 2 and s=4 balls. 
Note that s 1 is at most 2. 



Figure 2: This figure illustrates a split tree with parameters b 
and si = 2. 



4, so = 



in this work (and in general these parameters are the important ones for most 
results concerning split trees) are the cardinality n, the branch factor b and 
the split vector V; this is illustrated in Section L4 As an example, in the 
binary search tree considered as a split tree, 6 = 2 and the split vector V is 
{U, 1 — U) where U is a. uniform f/(0, 1) random variable. This is a beta (1, 1) 
random variable. In fact for many important split trees V is beta-distributed. 
(The other parameters for the binary search tree considered as a split tree 
are s = 1, Sq = 1 and Si = 0.) For the binary search tree the number of balls 
n is the same as the number of vertices N; this is not true for split trees in 
general. 



1.2 Notation 

In this section some of the notation that we use in the present study is 
collected. 

Let T^ denote a split tree with n balls; for simplicity we often write T. 
Let V(T) denote the set of vertices in a rooted tree T. We write 15*1 for the 
number of vertices in a set 5*. Note that for the number of vertices A^ we have 
N = |V(T"')|. Let Ty be a subtree rooted at v. Let n„ denote the number 
of balls in the subtree rooted at vertex v and let A''^ denote the number of 
vertices. Note that N^ = |V(T^)|. 

Let Dn denote the depth of the last inserted ball in the tree and Dk the 
depth of the fc-th inserted ball when all n balls have been added. Let D* 

/ h—^ ^k f 

be the average depth, i.e., D* = — ^=^ . We also use the notation Dj, for 

the depth of the node of ball k when it is added to the tree; this could differ 
from Dfc since the ball can move during the splitting process: DJ, < D^. 
Equivalently, D^ is the depth of the last ball in a split tree with k balls. Let 
d(v) denote the depth (or height) of a vertex v, sometimes we just write d 
for the depth of v. 

Let p{v) denote the parent of a vertex v. 

There are at least two different types of total path lengths in a tree T that 
are of interest: the sum of all depths (distances to the root) of the balls in 
T, and the sum of all the depths of the vertices in T. We denote the former 
by ^(T) and the latter by T(T). 

We use the standards notations, N{fi, a"^) for a normal distribution with 
expected value /x and variance a^, and Bin(m,p) for a random variable X 
with a binomial distribution with parameters m and p. We also use the 
notation mixed binomial distribution (X, Y) or for short mBin(X, Y) for a 
binomial distribution where at least one of the parameters X and y is a 
random variable (the other one could be deterministic). Let if„ denote the 
height of a split tree with n balls. 

Let T^., i E {I, . . . ,b^} he the subtrees rooted at depth L = [f3 log^ In n\ 
for some constant /3. For simplicity we just write T,, i G {1,...,6^} for 
these. Let d^^v) := d{v) — d{w), i.e., the depth of a vertex v in the subtree 
T^. In particular we write di{v) := d{v) — L for the depth of a vertex v in 
the subtrees Tj, i E {I, . . . , b^}. 

Recall that V^ is a random variable with the distribution of the iden- 
tically distributed components Vi, i G {!,..., 6} in the split vector V = 
(Vi, . . . ,Vb). Let A = ^5 be the size biased distribution of (Vi, . . . ,Vb), i.e., 
given (Vi, . . . , H), let A = Vj with probability Vj, see [3]. Let, 

c:=E{A) = bE{V^), 



and 

/i:=E('-lnA') =bE(-V\nV^ 
a^ ■=\air(\nA) =bE(v\n^v) - f?. (1) 

Note that the second equahties of /i and a imply that they are bounded. 
Similarly all moments of — In A are bounded. 

For a given e > 0, we say that a vertex v in T" is "good" if 

/i"^ Inn- ln°-^+' n < d{v) < fi-^ lnn + ln°-^+' n, (2) 

and "bad" otherwise. We write V*(T") for the set of good vertices in T"', 
and for the number of good vertices we write N* := |V*(T"') |. 

We say that K^ = Op{am) if am is a positive number and Y^ is a random 
variable such that Ym/dm — ?■ as m — )■ oo. We use two unusual types of 
order notation; let a^ be a positive number and Y^ a random variable, by the 
notation Y^ '■= Oipiam) we mean that (E{Ym^))p < Cam for some constant 
C, and by the notation Y^ '■= OLp{am) we mean that (E(Fm^))p/am -^ 0. 
We use the notation VLj for the a-field generated by {n^,, d{y) < ]}■ Finally 
we write ^j for the cr-field generated by the V- vectors for all vertices v with 
d{y)<3. 

1.3 A weak law and a central limit law for the depth 

In [3] Devroye presents a weak law of large numbers and a central limit law for 
Dn (the depth of the last inserted ball). If P(y = 1) = and P(\/ = 0) < 1 
then 



Inn 
and 



Dn P _i /„N 

-^ /x , (3) 



^{Dnj , , _i 



In 77, 



-> f^-'- (4) 



From the following lemma it follows easily (as we explain below) that Q 
also holds for the average depth D* . Recall that D^ is the depth of the k-th 
ball in the tree when all n balls are added. 

Lemma 1.1. For i < j , we have Di < Dj in stochastic sense. 



Proof. We show this by showing that for an arbitrary i G {l,...,n — 1}, 
Di < -Di+i, where the inequahties and equahties below are in the stochastic 
sense only. We show this by the use of coupling arguments. 

First consider two identical copies T and T of the split tree when i — 1 
balls have been added, where we let w in T denote the corresponding vertex 
of f in T. More precisely, we consider two split trees T and T with the 
same split vectors in all vertices of the infinite skeleton tree, and if a ball k, 
k < i — 1, is added to f in T then ball k is added to v in T. We now assume 
that we add the two balls i and i + 1 to T and T. 

If ball i and ball i + 1 are added to different leaves /i and I2 in T then 
in T we let them switch positions, i.e., ball i is added to I2 and ball i + 1 is 



added to /i. (Recall the notation Dl from Section 1.2 ) Hence, it is obvious 



for reasons of symmetry that Df = Dij^-^. When the balls G {z + 2, . . . , n} are 
added, we add them to the corresponding vertices in T and T. Thus, the two 
trees are identical in the whole process except for that ball i and ball i + 1 
always have switched positions in T and T. Hence, by symmetry Di = -Dj+i. 

If ball i and ball i + 1 are added to the same leaf / in T then there are 
three different cases: 

li rii < s — 2, so that / does not split when also ball i and ball i + 1 have 
been added, then T and T are still identical since ball i and ball i + 1 stay 
in /. When more balls are added we can again assume that ball i and ball 
z + 1 have switched positions in T and T at every step of the the recursive 
construction until all n balls are added. Hence, by symmetry Di = -Dj+i. 

If ri; = s — 1, so that / gets s + 1 balls when the new balls are added, I 
splits according to the usual splitting process when ball i + 1 is added. Again 
we let ball i and ball i + 1 switch positions in T and T. This means that if 
ball i is added to vi and ball z + 1 is added to V2 in T, then in T ball i is added 
to V2 and ball i + 1 is added to vi. Thus, again by symmetry D] = Di^^. By 

using the same type of argument as in the cases above we get Di = -Dj+i. 

If rii = s, so that / in T gets s + 2 balls when the new balls are added, we 
let / split according to the usual splitting process where / keeps Sq balls and 
sends the other balls to its children. 

If ball i is one of the sq balls in the children then it is obvious without 
using the coupling that Df < Di_^_^ and also Di < Z^j+i. 

If ball i is not one of the Si balls in the children of Z in T and ball i is 
added to f 1 and ball z + 1 is added to V2, then in T we can again assume that 
ball i is added to V2 and ball z + 1 is added to Vi. Thus, in the stochastic 
sense DJ = -D/^i, and Di = -Dj+i. 

If ball i is one of the Si balls in T, we use a related but not an identical 



type of coupling argument as in the previous cases. In this case ball i is added 
by uniformly choosing one of the b children of / each with probability |, while 
ball i + 1 is added by using the probabilities given by the components in the 
split vector V/ of /. Again T and T are identical until i — 1 balls are added 
baring the possibility of variation in the split vectors of the vertices above 
the leaves as described below. If ball i in T goes to a child vi of / related 
to a component Vj in V;, then we add ball i + 1 in T to t?i with probability 
min{l, ^k} and to one of the other children related to a component Vk > 1/b 

with probability max{0, 1 — j-^}, so that the sum of the probabilities gives 
the right marginal distribution. Assume that ball i is added to the child 
u of Z in T and ball i + 1 is added to the child w of / in T. This means 
that w relates to a component of the split vector of I at least as large as the 
component of the split vector of / related to v. Now we can assume that 
the split vectors in the vertices in the subtree rooted at v correspond to the 
split vectors in the vertices in the subtree rooted at w. This means that we 
can assume that when ball number j in the subtrees is added it goes to the 
corresponding vertex in both of the subtrees. However, note that the balls 
could have different labels if we consider their original label in the whole 
tree, since the subtree rooted in w could have more balls than the subtree 
rooted in v. Thus, as long as the subtrees have the same number of balls, 
new balls are added to the corresponding positions in these subtrees, and 
ball i and ball i + 1 are also held by vertices of corresponding positions. This 
construction shows that if the subtrees rooted in w and v have k and / balls, 
respectively, where k > I, and ball i in Tj, is in vertex h, then ball z + 1 in 
T is in a subtree of T^s with root corresponding to the position of h. This 
shows that in the stochastic sense Di < -Dj+i. 

Hence, in all cases, D^ < -Dj+i stochastically and thus for i < j, it follows 
that Di < Dj in stochastic sense. D 

This means in particular that for all k < n, E{Dk) < E(D„). Since 
the sum of — L-^ for k < r-i— is o(l), we can ignore the balls k < r^- 
We consider the balls k > 7-3—- For ball k > -^ it follows from vM that 

— In n — m n ^ f 

E(D{) ~ ^-^\nn. Thus, since for all k > ^, E(L)f) < E{Dk) < E(D„), 
we get 

-j > ^ ■ 5 

inn 
Furthermore, see [Sj Theorem 1], if a > 0, and assuming that V is not 



monoatomic, i.e., we don't have V = |, 



''"7"''°" 4iV(0.1). (6) 



n 



d 



where iV(0, 1) denotes the standard Normal distribution and — )■ denotes con- 
vergence in distribution. Tries are special forms of split trees with a random 
permutation of deterministic components {pi,P2, ■ ■ ■ ,Pb) and therefore not 
as random as many other examples. (In the literature tries have also been 
treated separately to other random trees of logarithmic height.) Of all the 
most common examples of split trees only some special cases of tries (the 
symmetric tries and symmetric digital search trees) have a monoatomic distri- 
bution of V. From (p| it follows that "most" nodes lie at /i~^ lnn + 0{y/hi'i 



.n] 



1.4 Subtrees 

For the split tree where the number of balls n > s, there are sq balls in 
the root vertex and the cardinalities of the b subtrees are distributed as 
(si, . . . , Si) plus a multinomial vector {n — So — bsi,Vi, . . . ,Vb). Thus, condi- 
tioning on the random V -vector that belongs to the root, the subtrees rooted 
at the children have cardinalities close to nVi, . . . ,nVb. This is often used in 
applications of random binary search trees. In particular, we used this fact 
frequently in |Tl]. "The split tree generating algorithm" described above, 
and the fact that a mBin(X, pi) in which X is Bm{m,p2) is distributed as 
a Bm{m,piP2), give in a stochastic sense, an upper bound on the number 
of balls n„ in a subtree rooted at a vertex v. Let f be a vertex at depth d, 
conditioning on ^^ (i.e., the cr-field generated by the V vectors for all vertices 
V with d{v) < d), gives 

d d 

n^ < Bin(n, Yl Wj,^) + Bin(si, JJ Wj,^) 

d 

+ Bin(si, Yl Wj,v) + ■■■ + Bin(si, Wd,v) + si, (7) 

i=3 

where Wj^^^j G {1, . . . rf} are i.i.d. random variables given by the split vectors 
associated with the nodes in the unique path from v to the root. This means 
in particular that Wj^^ = V. However, we note that the terms in (7) are not 
independent. Also observe that ^^ is equivalently the a-field generated by 
Wj^vij € {^) ■ ■ ■ id} for all V with d{v) = d. Similarly, we also have a lower 



bound for n„, i.e., for v at depth d, conditioning on '^a in stochastic sense, 

d d 



n, > Bin(n, JJ W,,,) - Bin(s, JJ W,- ,) 
i=i j=2 

d 

- Bin(s, II W,,,) Bin(s, Wa,,); (8) 

i=3 

we can replace the term s by Sq + bsi < s for a sharper bound. As in ([T]) the 
terms in rtsl) are not independent. 

Recall that for a Bin(m,p) distribution, the expected value is mp and the 
variance is mp{l — p). Thus, Chebyshev's inequality applied to the domi- 
nating term Bin(n, Y[j=i ^j,v) in (7) gives that n^ for v at depth d is close 
to 

M: := nWi,,W2,o . . . Wd,.. (9) 

More precisely by using ([7| and (IS]), the Chebyshev and Markov inequalities 
give for V with d{v) = d, that for large n, 

A n«^ E(Var(Bin(n,n,til^,,.)%)) 



i=i 



^1.2 



E(E(Bin(s, n -=2 Wj,,) + Bin(s, [its WG>) + ■ ■ ■ + s 
+4^ ^ 



^d 



^0.6 

Since the Uy^s (conditioned on the split vectors) for all v at the same depth 
are identically distributed, we sometimes skip the vertex index of Wj^y and 
just write Wj. 

1.5 Renewal Theory 

Renewal theory is a widely used branch of probability theory that generalizes 
Poisson processes to arbitrary holding times. A classic in this field is Feller [5] 
on recurrent events. First we recollect some standard notation. Let Xq = 
a.s.. Let Xk, fc > 1, be i.i.d. nonnegative random variables distributed as 
X and let Sm, m > 1, be the partial sums. Let F denote the distribution 
function of X, and let Fm be the distribution function of 5*^, m > 0. Thus, 
for a; > 0, 

Fo(x) = l, Fi(x) = F(x), F^{x)=F^\x), 

10 



i.e., Fm equals the m-fold convolution of F itself. The "renewal counting 
process" {N{t), t > 0} is defined by 

Nit) := max{m : S'm < t}, 

which one can think of as the number of renewals before time t of an object 
with a lifetime distributed as the random variable X. In the specific case 
when X = Exp(A), {A/'(t), t > 0} is a "Poisson process". An important well 
studied function is the so called "standard renewal function" defined as 

oo 

V{t):=Y,FUt), (11) 

m=0 

which one can easily show is equal to E(A/'(t)). The renewal function V{t) 
satisfies the so called renewal equation 

V{t) = l + {V*dF){t), t>0. 

For a broader introduction to renewal theory, see e.g. [1], |6], [7] and [9]. 
One of the main purposes of this study is to introduce renewal theory in the 
context of spht trees. Recall from ^ in Section 1.4 that the subtree size n^ 



for V at depth k, is close to nWiW2 ■ ■ ■ Wk, where Wj, j G {1, . . . , k}, are in- 
dependent random variables distributed as V. Now let Yk := —'^j=i^^Wj, 
and for simplicity we also denote the summands Wr '■= —InWj. Note that 
nWiW2 ■ ■ ■ Wk = ne~^*. Recall that in a binary search tree, the split vec- 
tor V = (Vi,V2) is distributed as (f/, 1 — U) where U \s a. uniform f/(0, 1) 
random variable. For this specific case of a split tree the sum Yk-, (where 
Wj, j G {1, . . . , /c}, in this case are i.i.d. uniform t/(0, 1) random variables) 
is distributed as a r(A;, 1) random variable. This fact is used by, e.g., De- 
vroye in |1] to determine the height of a binary search tree. For general split 
trees there is no simple common distribution function of 'Yl,j=i InVFj, instead 
renewal theory can be used. 
Let 

Vk{t) := h''V{Yk < t). 

We define the renewal function 

oo 

U{t):=J2Mt)- (12) 

fc=i 

We also denote z/(t) := z/i(t) = bPlw^ < t). For U{t) we obtain the following 
renewal equation 

oo 

U{t) = v{t) + Y^{vk * dv){t) = v{t) + {U * dv){t). (13) 



fc=i 
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2 Main Results 

In this section we present the main theorems of this work. 



(Al). In this work we assume as in Section 1.3 that P(V = 1) = 0, and we 



also assume for simplicity that P{V = 0) = and that —\nV is non-lattice. 

The reason for the non-lattice assumption (Al) is that we use renewal 
theory and there it often becomes necessary to distinguish between lat- 
tice and non-lattice distributions. Note that the assumption that V is not 



monoatomic in Section 1.3 is included in the assumption that — Inl^ is non- 
lattice. Again of the common split trees only for some special cases of tries 
and digital search trees does — In V^ have a lattice distribution. Our first 
main result is on the relation between the number of vertices N (recall that 
this is a random variable) and the number of balls n. 

Theorem 2.1. There is a constant a depending on the type of split tree such 
that 

E{N) = an + o{n), (14) 

and 

Var(A) = o(n2). (15) 

Recall that there is a central limit law for the depth Dn in dol) so that 
most vertices are close to — + O i y/lnn j , our next result sharpens this fact. 
Recall that for any constant e > 0, we say that a vertex v in T" is "good" if 

/i-i Inn- ln°-^+^ n < d{v) < /x"^ In n + ln°-^+^ n, 

and "bad" otherwise. 

Theorem 2.2. For any choice of e > 0, the number of bad nodes in T"' is 
bounded by C^i f j^^ j for any constant k. 

In the third main result we sharpen the limit laws in Q and dsl) for the 
expected value of the depth of the last ball Dn and the average depth D*. 
We also find the first asymptotic of the variances of the /c:th ball Dk for all 

k, T^ <k<n. 

'Inn — — 

Theorem 2.3. For the expected value of the depth of the last ball we have 

vlnn 
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and the same result holds for the average depth D^, i.e., 



VlK 



n 



0. (17) 



Furthermore, for the variance of the depth of the k:th hall we have that for 
all r^ < k<n, 

^^^ ^ aV-^ (18) 

mn 



We complete this section by stating two corollaries of Theorem 2.3 Recall 
that we write V* (T") for the set of good vertices in T", i.e., those with depths 
that belong to the strip in ^. 

Corollary 2.1. Summing over all vertices give 

e( y^ {d{v) - fi~^ In n^] = an fi'^a'^ Inn + o{n Inn). (19) 

For the good vertices we also have 

e( ^ {d{v) - fi~^ In nYj = an fi~^a'^ Inn + o{n Inn). (20) 

We write V* (Tj) for the set of good vertices in Tj. 

Corollary 2.2. Let L = [/31og^lnnJ for some large constant /3. Then, 
summing over all vertices give 

E-^ {di{v) -/i'Mnnj)^ _ a^an ^ ( ^ \ /^.n 

^ 11 ■^ In ni m n \\n n/ 

'-^ ^ev(T,) 

and for the good vertices we also have 

y y (^-("^-^'^""-^' = g^+o,f^). (22) 

^ ^ , /i-sin^n, In^n ^\\n^n) ^ ' 
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3 Some Fundamental Renewal Theory Results 

The main goal of this section is to present a renewal theory lemma and a 
corollary of this lemma, which are both frequently used in this study. In 



contrast to standard renewal theory the distribution function i/(t) in (13) is 



not a probability measure. However, to solve (13) we can apply [1, Theorem 
VI.5.1] which deals with non probability measures. The result we get is 
presented in the following lemma. 



Lemma 3.1. The renewal function U{t) in (12) satisfies 



U{t) = {fi-^ + o{l))e^ ast^ oo. (23) 

Proof. Since the distribution function z/(t) is not a probability measure, we 
define another ("conjugate" or "tilted") measure a; on [0, oo) by 

duj{t) = e-*dv{t). 



Recall from Section |1.2| that A = V^ is the size biased distribution of 
(Vi, . . . , Vfe). We note that uj{x) is the distribution function of the random 
variable — In A since 

p(-lnA<,x) =E(E(/{-ln\/s<a;}|(l^i,...,H))) = 
b 
Y.(y^I{-\YiVi<x}V^ =6E('/{-lnV<x}e-''^(^)) = uj{x). 
1=1 

Thus, a; is a probability measure. Further, by recalling /x := E(— InA) and 
a^ := Var(— InA) gives 

E(a;) = /i, and Var(a;) = crl (24) 

Define U{t) := e'*U{t) and v[t) := e^V(t). We shall apply [1, Theorem 
VI.5.1], but first we need to show that the condition that v{t) is "directly 
Riemann integrable" (d.R.i.) is satisfied. Note that v{t) < 6e~*, and thus 
since z/(t) is also continuous almost everywhere, by [H Proposition IV.4.1.(iv)] 
it follows that i^(t) is d.R.i. if 6e~* is d.R.i.. That 6e~* is d.R.i. follows 
by applying [TJ Proposition IV.4.1.(v)], since 6e~* is a nonincreasing and 
Lebesgue integrable function. Then by applying ^ Theorem VI.5.1] and 



( p4| ) we get 

U{t) = i){t) + {U*du){t), (25) 
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where u{t) is a probability measure, and 



U{t) -^ ji ^ I h'{x)dx = /i "^ / v{x)e ^dx =: n. 
Jo Jo 

Integration by parts now gives 

/ oo r°° 

= /i"^6E(e-^') =/i~\ 
Thus, U{t) = (/i-i + o(l))e*. 



(26) 



(27) 

n 



The following result is a very useful corollary of 



/emma 



3.11 We w rite 
that 



1.4 



for V at depth d{v), M" := rij^-i'^^ Wj. Recall from (9) in Section 
this is close to the real subtree size Uy. 

Corollary 3.1. By taking the sum over vertices v, d{v) = k and letting 
-^ oo, we get that the expected number of nodes with M" > K is equal to 



K 



E( 



oo 



k=0 



--: [/(Inn - IniT) + 1 = (/i"' + o(l)) 



n 
K' 



f28) 



Proof. By using Lemma [3. 1| we get 

d 



d=0 j=l d=0 



{f^-' + o{l)) 



n 
K' 



(29) 

n 



We complete this section with a more general result in renewal theory, 
and a corollary of a more specific result that is valid for the renewal function 



U{t) in (12) 



Theorem 3.1. Let F be a non-lattice probability measure and suppose that 
< /i = E(X) = /q°° xdF{x) < oo and E(X2) = a^ + /^^ < oo. 
Let 



Z{t)=z{t)+ I Z{t-u)dF{u), t>0, 
Jo 



(30) 
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where z{t) is a nonnegative function, such that a := J^ z{u)du < oo. Define 

G{x) = r (Z{t) - -\dt. (31) 

Then 



lim G{x) = / uz{u)du + a — — — — . (32) 

Proof. Let V{t) be the standard renewal function in (11), where Fm{t) = 
P(Er=o^fc <t)- By applying P Theorem IV.2.4], 



Z{t) = z{t- u)dV{u) 



z{u)dV{t - u) 



(33) 



where the last equahty follows because V{t) = for t < 0. By applying (33) 
and Fubini's Theorem we get 



ax 



G{x) = / z{u) / dV{t - u)du = / z{u)V{x - u)du - — . (34) 

Jo io f^ Jo f^ 



Hence, 

G{x) 



z(u) ( V(x — u) ]du 

^ /^^ 

— / z{u)udu / z{u)xdu + I z{u)\V{x — u) ]du. 

f^Jo f^Jx Jo ^ /^ ^ 

(35) 



From [U Proposition VI. 4.1] we have V(t) — - — )■ ^^^2 and by p", Propo- 



1 <r o-^+M^ 



Hence, the Lebesgue dominated con- 



sition VI.4.2], < V{t) 

vergence theorem applied to the last integral in ( 35 ) gives 



lim 



z{u) ( V{x — u) 



X — u 



/i 



/■oo 

I{u < x}du = / z{u) 
Jo 



a^ + fi^ 



2/^2 



du. (36) 



Note that for all x, J^ z{u){u — x)du > 0. Thus, if j^ z{u)udu is integrable 
lima,_^oo /^°° z{u)xdu = 0, and the convergence result in (32) obviously follows. 
If J^ z{u)udu is not integrable then we have a special case of (32), i.e., 
lima;^oo G{x) = -oo. D 
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We define the function 

W{x) 



e-\U{t) - iJ-^e')dt. 



(37) 



Tlie next result is a corollary of Theorem |3.1 



Corollary 3.2. The function W{x) in (31) satisfies 
W{x) 



a'-fi^ 



2/i^ 



fi + o(l) as X ^ oo. 



(38) 



Proof. We apply Theorem 3.1 to Z{t) = U{t) = e *U{t) defined in the proof 

11"^ ' I I 

(recall that U{t) satisfies the renewal equation in (25)). Now, 



the constant a as defined in Theorem 



3.1 



(26) and (27) we get a = 1. Using (24) and (26)-(27) gives, 



satisfies a = j^ u{u)du, thus from 



h'{u)udu = e '^u{u)udu = e ^i'{u)du + ue ^dv{u) = 1 + /i. 

Jo Jo Jo 

n 



4 Proofs of the Main Results 



4.1 Proof of Theorem 2.1 



4.1.1 Lemmas of Theorem [2TT1 

We present below some crucial lemmas by which we can then prove Theorem 



2.1 The proofs of these lemmas are given in Section 4.1.4 below. The first 
lemma is fundamental for the proof. 

Lemma 4.1. For the first moment of the number of vertices N we have 

E{N) = 0{n) (39) 

and for the second moment of N we have 

E{N^) = o(n^). (40) 



Lemma 4.2. Adding K halls to a tree will only affect the expected number 
of nodes in a split tree by 0{K) nodes. 
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Let R be the set of vertices such that conditioned on the spht vectors, 
r eR,ii M; := n H JJ Wj < B and M^^^^ := n Uf=i~^ ^j > B, recall that 
p{r) is the parent of r. For now we just let B be large; however, later our 



choice of B will be more precise. To show ( 14 ) we consider all subtrees rooted 



at some vertex r G R. We denote these subtrees by T^^^, r G R. Recall from 
(Iq]) that with "large" probability the cardinality n^ is "close" to M". We will 
show that in fact we can replace rij. by M" in our calculations. Let n^ be 
the number of balls and let A^^ be the number of nodes in the Tr^B subtree. 



Corollary 3.1 implies that most vertices are in the Tr^B subtrees, i.e.. 



E{N) = E(^J2^r)+o(^ 



reR 



(41) 



The next lemma shows that the expected number of vertices in the T^^b 
subtrees with subtree sizes n^ that differ significantly from M" is bounded 
by a "small" error term for large B. Since the variance of a Bin{m,p) dis- 
tribution is mip — p^), the Chebyshev and Markov inequalities give similarly 



as in (10) that for large S, 



'( 



P In. -M"| > B 



0.6 



<4 



^1.2 ^ ^0.6 



k 1 

- < 



50.1 



(42) 



From (41) we have 



E(A^) = E('^iV:,/{|n, - M;| > B^-^} 

reR 

E(5^iV,/{K-M;|<i?°-'^})+0(|). (43) 



reR 



Lemma 4.3. The expected value of the number of nodes that are not in the 
Tr,B, r G R, subtrees with subtree size Ur that differs from M" with at least 
5°-6 balls, IS 



EiJ2NrI{\nr - M:\ > B'-'}) = 0(-^ 



(44) 



reR 



hence, from (43) 

E{N) = E(^J2^rI{\nr-M-\<B'-'})+0[^). (45) 



reR 



18 



We also sub-divide the Tr^B, r & R, subtrees into smaller classes, wherein 
the M"'s in each class are close to each-other. Choose 7 := e^ and let 
Z := {B, B — '-fB, B — 2'~fB, . . . , eB}, where e = | for some positive integer 
k. We write R^ C R^ z E Z, for the set of vertices r E R, such that 
M" E [z — 'jB, z) and M"/ n > B. (Note that the intervals are of length 
75 and that the set Z contains at most - elements.) We write \Rz\ for the 
number of nodes in R^. The next lemma is a result that we get by the use 



of renewal theory applied to the renewal function U{t) in (12). 



Lemma 4.4. Let 5 := {1, 1 — 7, 1 — 27, . . . ,e}, where 7 = e^. Choose a E S 

and let ^ — )■ 00, then 



B 



E{\R^b\) 



n_ 
B 



Ca + 0(1), 



(46) 



for a constant Ca (only depending on a), and also Xlaes^a = C^(l); where 
the constant in O is not depending on e. 

Before proving these lemmas we show how their use leads to the proof of 
Theorem 12.11 



4.1.2 Proof of (14) in Theorem 2.1 



Proof. For showing ( 14 ) it is enough to show that for two arbitrary values of 



the cardinality n and n, where n > n, we have 

E(A^) E(iV) 



n 



n 



0[e) as n -)■ 00, Ve > 0. 



(47) 



Since (47) implies that — ^ is Cauchy it follows that — ^ converges to some 



constant a as n tends to infinity; hence, we deduce (14). 

We will now prove ( [47| ). 

Recall from Section ]4.1.1| that we will consider the subtrees Tj,,b, r E R, 
rooted at r; these are defined such that M" := nfl i^j Wj < B and M^,^ := 

nUf:l-'W,>B. 

Let -R' C i? be the set of vertices such that r G -R' if 

\nr-Ml!\<B'^-^. (48) 



Lemma 4.3 shows that we only need to consider the vertices in r G R'. 
Let R" C R' be the set of vertices such that r G i?" if r G R' and 

eB < AC < B. 



(49) 
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We will now explain that it is enough to consider the vertices r E B!' . 

Corollary 3.1 for K = B gives that the expected number of parents p(r) 

> 5 is CI 



such that M 



p{r) 



thus, since they only have h children each 
also the expectation of \R\ is Oi 



Lemma 4.1, we get that the expected number of nodes in the T^^b, r G 



Hence, for r G i?' by using (|39|) in 



with M;' < tB is bounded by 0[en). 
From (45)in Lemma |4. 3 1 we get 



E(Ar) = e( ^ iV,) + 0{m) + o( 



n 



rdR" 



B 



0.1 



(50) 



Recall that we sub-divide the T^^s, r E R, subtrees into smaller classes, 
wherein the M^'s in each class are close to each-other, by introducing the 
subsets Rz ^ R, z & Z, where Z = {B, B — jB, B — 2'yB, . . . , eB}. Hence, 
(50) gives 



E{N) = e(^J2 Y1 Nr^+0{enj+0(^ 



n 



z&Z reR'nRz 



BO. 



(51) 



We will now apply Lemma 4.2 to calculate the expected value in (51). 



Let rz be an arbitrarily chosen node in R' fl Rz, where z ^ Z. By using (48) 



and Lemma |42} for any node r^ G R' DRz, we get that the expected number 
of nodes in a tree with the number of balls in an interval [z — '-/B, z) is equal 

to E(A^rJ + o(-fBj. By using (51) this implies that 



E{N) = ^E(|i?' n Rz\) (E{NrJ + 0(75)) + 0{en) + o(^ 



zez 



n 
B^ 



(52) 



Define a^ as the quotient of the expected number of vertices in a tree with 
cardinality [xj divided by [x\. Note from Lemma 4.1 that a^ = C^(l)- 



Recall from Lemma 4.4 that 5" = {1, 1 — 7, 1 — 27, . . . , e}. By using d46 



in Lemma 4.4 and applying (42) we have that for each choice of 7 and a E S, 



there is a a^ such that for a constant c^ (depending on a). 



Ei\R'nR^B\) 



< 



r 



+ °(b^ 



(53) 



whenever ^ > 



. We now choose B = Inn, where n is the smallest of the 
two arbitrary values we start with (i.e., n > n). Thus, we have by the choice 
of B (for n large enough) that ^ > ^ so that (53) holds. 
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O(By) 



Note that since Xlaes'^a = 0{1), we have that Xlaes'^a i? — ^il)- 
Recall that 7 := e^. Thus, for a constant c^ (depending on a) and a^s = 
0(1), we get from (52) and (53) that 



E(iV) =nY,Ca^ (aBa^B + 0(3^)^ + n J^ 0(a„B7') + 0{en) = 

aeS a&S 

= ny ^ aaaBCg + O (era) . (54) 



aes 



In analogy we also get for n > n, 



E{N) = n y2 aaasCa + O (en) . 



Thus, (47) follows, which shows (14) 



(55) 



D 



4.1.3 Proof of (15) in Theorem 2.1 



Proof. First note that (40) in Lemma 



4.1 



The purpose is to use the variance formula 



implies that Var(A^) = Oin' 



Var(r) = E('var(r|^)') + Var('E(r|^)V 



(56) 



where y is a random variable and ^ is a sub-cr-field, see e.g.|10t exercise 
10.17-2]. We consider the subtrees Tj, 1 < i < h^ at depth D = clnn, 
choosing the constant c small enough so that the number of nodes Zjj between 
depth D and the root is 0[n'^) for some arbitrary small e. Let rii be the 
number of balls and Ni the number of nodes in Tj. Conditioned on Qd, 
Ni, 1 < i < b^, are independent and it follows that. 



ftD b^ feO 

Var(iV|f]^) = Var(5^iV, + ZdI^d) = ^ Var(iV,|fi^) = 5^0(n^ 
j=i j=i 1=1 



(57) 



Taking expectation in (57) gives 



6° 



E Var(iV|l]^) =5^0(E(n^)). 



(5^ 



i=l 



Recall that Qd is the a-field generated by {n„, d{v) < D}. 
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Lemma 4.5. For D = clnn there is a 6 > 0, such that 



(59) 



Proof. The representation of subtree sizes in split trees described in ([T]) in 
Section [L4] gives in particular that conditioning on '^u, n^ for i at depth D 
is bounded from above (in stochastic sense) by 



D 



Bm{n,Y[Wj) + siD, 



(60) 



where Wj,j G {1, . . . , D}, are i.i.d. random variables distributed as V. The 
fact that the second moment of a Bin(m,p) is m^p'^ + mp — mp^ and the 



bound of rii in (60) give 



D D 

E{n,'\wn) < ^' n ^/ + ^^^^ n ^^■) + ^(^')- 

Note that E{Wf) < E{Wj) = I, since Wj e (0, 1). Hence, there is an e > 
such that 



D 



E{n,^)<n^llE{W/) + 0(^^)+0{D^^ 



nD 



< 



i=i 



n 



iP 



\D 



+ 



nD 



0{D' 



(61) 



and thus there is a 5 > such that 



i=l 



which shows (59). 



Thus, (58) and (59) in Lemma 4.5 give 



E{Yar{N\^D)) = 0{n^-^) 



By applying (14) in Theorem 2.1 gives 

6^ 



E{N\Qd) = Y. {am + o{nS) + £(^1,1^^). 



n 



(62) 



(63) 



22 



Applying (63) gives 

Var('E(iV|fiB)) = Var (an + o{n)) = o{n^ 



(64) 



Thus, by applying the variance formula in (56) we get from (62) and (64) 
that Var(A^) = o{n^). D 



Remark 4.1. The proof shows that if we can improve the result in (14) in 



Theorem 2.1 such that E(A^) = an + 0{n^ '^^) for some constant ci > 0, we 



will also get a sharper result for the variance, i.e., Var(A^) = o(r2^ ^^) for 
some constant C2 > 0. 

4.1.4 Proofs of the Lemmas of Theorem 12.11 



Proof of Lemma 4^.1. (Note that if Sq > it is always true that N < n and 
if Si > we always have A^ < 2n.) For sq = Si = we can argue as follows: 
When a new ball is added to the tree the expected number of additional 
nodes is bounded by the expected number of nodes one gets from a splitting 
node. Let Z be the number of nodes that one gets when a node of s + 1 balls 
splits. Then 



E(Z) = J2kP{Z = k). 



(65) 



fc=i 



Note that once a node gives balls to at least 2 children the splitting process 
ends. Thus, 



P Z = k 



V, d{v)=k j=l 



+1 



Hence, (65) implies 



E{Z) = J2ko({bE{V'+^))'' 



k=l 



There is a 5 > such that 6E(y^+^) < b ^ since 

E(r^+i) < E{V) = ^, 
for V G (0, 1). Thus, (66) gives 

CO , _x 



k=l 



(1-6-5)2 



(66) 



0(1). 



(67) 



(6^ 
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This shows (39) 



Now we show (40). Note that (40) obviously holds if Si > or sq > 0, 



since then N < 2n. Recall that Z is the number of nodes that one gets when 
a node of s + 1 balls splits. Then by the well-known Minkowski's inequality 



E{N^) < n^E{Z^). 



(69) 



By similar calculations as in (66)-(68) we get that for some constant S > 

oo oo 

E(Z2) <J2k^P{Z = k) = J2k^<^(b'''^) = 0{1). 



(70) 



k=l 



k=l 



Thus, (40) follows from (69) and (70) 



n 



Proof of Lemma 4^.1. The proof of this lemma is in analogy with the proof of 



(39 ) in Lemma 4.1 Adding one ball to the tree will only increase the vertices 



if it is added to a leaf with s balls. Recall that Z is the number of nodes that 



one gets when a node of s + 1 balls splits. Hence, (66) gives E(Z) = £>(!), 
implying that K balls can only create 0[^K^ additional nodes. D 



Proof of Lemma 14- 3[ By applying (42) we get that with probability at least 

1 



1 1 



|n^-M"| <B 



0.6 



We have 



(71) 



(72) 



where 



r 

El = E(^n,J{|n, - M^\ > E^-'^j/K < 2M,"}), 

r 

E2 = E(^J2nrI{\nr - M;| > B'>'^}I{nr > 2M;}). 



Hence, the facts that ^^ M" = (9(n) and that the bound in (71 ) holds with 
probability 1 — -^, give 

El < e(5^ 2M;/{|n, - M;| > i?°-6}) = o(-^ 
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Recall that R is the set of vertices such that r G -R, if r is the root of a 
Tr^B subtree. We obviously have 

E2 < e(^2K - M;)JK > 2M;}/{t; G i?}). 

V 

By summing over vertices v at depth k we get 

00 
E2<Y1 2&''E (k - K)I{nv > 2M;}) P (t; G i?) . (73) 

fc=0 



We write F for the expected value in (73), i.e 



F := E((n, - M;)/K > 2M;} 



Hence, the conditional Cauchy-Schwarz and the conditional Markov inequal- 
ities give 



F: <E| A/E((n, -M," 



%)Jp(n,>2M-\%^ 



< min E 



E{{n,-M^ 



' ;■^ 



V 



M!. 



El a/E( in.-M:; 



(74) 



Prom ([7]) we have that for all v with (i(f) = d, conditioned on '^^ (i.e., the 
cr-field generated by Wj^^,j G {1, . . . , d}), riy < n'^ + n", where 



n. 



Bm{n,l[W,, 



d 



n. 



I ■= Bin(si, J] ly,-,) + ■ ■ ■ + Bin(si, W^,,) + Si. 
i=2 
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Thus, (|74|) gives for M^^ > 1 
/ 



F<E 



<E 



<E 



E( (n„-M," 



2 x^ 



M" 



e((< - M,")'|^,) e(«)2 + 2« - 2<M; 



^„ 



M.'; 



E( (<-M," 



\ 



M: 



Ml: 



+ E «)^ +2E < 



(75) 



where we in the last equahty apply that E I nM = M". For M" < 1 we apply 



that (74) gives 



F<E A E n„-M 



^. 



(76) 



By applying the fact that the variance of a Bin(?7i,p) distribution is m{p—p^ 



we get E( [n'^-M^ 



< M^, and from the Minkowski's inequality we 

easily deduce that E( (n")^ j = 0{1). Hence, by using that we can bound F 
as in (75) for M" > 1, and by the bound in (76) for Af" < 1, we get that 



F = C(l). Thus, from ([73]) we get 

oo 

E2<5]]^''C'(l)P(^ei?). 



(77) 



fc=0 



Note that v & R only if M", s > B. Hence, by applying Corollary 



p{v) 



K = B, we get from (77) that E2 = C^(^)- By applying Lemma 
combination with ( 72 ) and using the bounds of Ei and E2 we get 



3.1 



for 
in 



E(J]Ar,/{|n,-M;| >5°-6}) =c(E(^n,,/{|n,, -M;| > 5°-^}) 



O 



n 
B^ 



{n 



n 
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Proof of Lemma J^_J^_. Recall the definition of Yfc = — X]i=i ^^ ^i ^'^'^ ^(^) ~ 
6P(- In Wj < t). Also recall that we write S = {1, 1 - 7, 1 - 27, . . . , e} for 
7 = e^. We have for a G 5* 

00 
H\RaB\) = Y,b'+'(p(^{Y,-\nW,^, > ln| -lna}f|{n < ln|}) 

fc=0 

-P({n-lnW^,+i>ln|-ln(a-7)}nm<ln|}))- 

We write q := In^. From the definition of U{t) we have that E{\RaB\) is 
equal to 



Z{q):= I b(p(-lnWk+i>q-t-lna 

- P( - \nWk+, >q-t -In (a- 7)))c^t/(t). 



Hence, 



Z{q):= I bP(q-t-lna<~\nWk+i<q-t-ln(a--f))dU{t). (79) 



We write 



G{t) ■.= bp(t-\na < -\nWk+i < t - In Ta - 7 



Thus, 



Z{q) = {G*dU){q). 
Recall that we write duj{t) = e~^dv{t) where u{t) is a probability measure. 



Recall from (25) that we have 

U{t) = u{t) + {U*duj){t), 

where U{t) := e^^U{t) and z/(t) := e~*z^(t). Thus, by using [H Theorem 
VI.5.1] we have for Z[x) = e~^Z(x) and G{x) = e~^G{x) that 

Z{q) = {G*dio){q). 



By using (79) this implies that 



Z{q)= / be^-'^P(q-t-\na<-\nWk+i<q-t~\n(a--f]]dco{t). 

(80) 
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By using the key renewal theorem [9l Theorem II. 4. 3] applied to U{t) we 
get 

\imZ{q) = - e~*p(t-\na <-\nWk+i<t -In (a --r))dt. (81) 

g-s>oo /i Jq V V / / 

Note that limg_>oo Z{q) := c^, for some constant Cq only depending on a. 
Thus, by using Z{x) = e~^Z{x) we get that 

E(|i?«B|) = |c„ + o(|), 



for the constant Ca (only depending on a), which shows (46). 
Also note that we have 



poo / \ \ 

^c„ = -J^/ e~*^P(t-lna < -InWfe+i < t-ln(a-7Jjc/t (82) 

r°° / \ b 

/ e-*P(t < -Iniy^+i <t-ln(e-7))dt < -. (83) 

Jo ^ ^ A* 



ae5 ^ -^0 Q6S 



6 '•°° 



D 



4.2 Proof of Theorem 2.2 



Proof. We use large deviations to show this theorem (in fact we get a sharper 
bound of the number of bad nodes) . Note that a vertex v belongs to the tree 
if and only if n^, > 1. Recall that there is an upper bound of n„ with d{v) = d 
in ([T]) above, i.e., conditioning on ^^ in stochastic sense. 



n, 



d d 

< Bin(n, JJ W,,,) + Bin(si, JJ W,^,) 
i=i i=2 

d 

+ Bin(si, JJ ly,,,) + ■ ■ ■ + Bin(si, 1^^,,) + Si, (84) 

i=3 

where W^j>, j G {1, . . . ,(i}, are i.i.d. random variables distributed as V. It 



is enough to just consider the first term Bin(n, 117=1 ^j,v) i^i (84), and prove 
that the number of bad nodes with Bin(n, ni=i ^j,v) > 1 is bounded by 
Oil ( j^fc+i ^ j , where we choose k large enough. If si = 0, Bin(n, 117=1 ^j>) 
is the only term in (84). We now explain the fact that we can ignore the 



terms in n^ that occurs because of the parameter si. Assume that for split 
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trees with si = 0, the number of bad nodes is bounded by O^i ( — ^^t 

We first consider the vertices with d < /i^^lnn — In ' '^^ n. If si > 0, we 
assume that we first add the n balls as in the construction of a split tree 
with the parameter si = 0. Hence, the number of vertices v with d < 

/i~^ Inn — In"'^"*"" n, is bounded by O^i i Jl^ ) . We now repay the subtree 
sizes for their potential loss of balls because of Si > 0. A vertex v at depth 
d can at most have a loss of sid balls in the subtree rooted at v. These balls 
cannot give more than Sibd nodes to the tree (since only if sq = Si = it is 
possible for an increment of more than b nodes when a new ball is added to 
the tree). Thus, since d < ^~^ Inn and the fact that we assume that we have 

C^Li ( fc+i ) nodes before the repayment of the loss of balls, these additional 

balls cannot give more than Oli i —^ j nodes. Now we consider the vertices 

with d > fi^^ Inn + In'^'^''"'^ n. Again we first distribute the n balls assuming 
that Si = 0, and then repay for the potential loss of balls in the subtrees if 
si > 0. First note that for d = 0{lnn) we can argue as in the previous case. 
This means that the number of nodes with fi~^ Inn + In ' '^'^ n < d < Klnn 

for some arbitrary constant K is bounded by O^i i p^ j . For larger d we 

argue as follows: For any constant Ki > 0, 

d d 

mBin(si, J]^ Wj^^) + mBin(si, J]^ Wj^^) H h mBin(si, Wd,v) + Si 

J=2 i=3 

d d 

< mBin(si,]^ W^j^j,) H h mBin(si, ^ Wj^^) + KiSilnn. 

i=2 j=d- 



K\ In n\ 



The Markov inequality gives, 

d 



P(mBin(si, JJ W^,,,) + ■ ■ ■ + mBin(si, JJ 1^,>) > l) 

3=2 j=d- XilnnJ 

d d 

<E(Bin(si,J]W^,-,) + --- + Bin(si, J] W,,,)') =o(b 



i=2 i=d- 



Kilnn\ 

(85) 



where the last equality is obtained by first condition on ^^ and then take 
the expected value twice. Thus, the expected number of vertices that gets 

a repayment of at least Ki-Silnn + 2 balls is bounded by Oi ^K^inn )■ Since 

si > 0, we can assume that d < n. Hence, the expected number of balls of 
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this contribution is O i ^^"inn ) ; choosing Ki large enough this number is just 

0(1) and can thus be ignored. 

It remains to prove that if Si = the number of vertices v, where d{v) < 
fi~^ In ra — In ' "^"^ n or d{v) > fi'^ In ra + In ' ^"^ n, with n^, > 1 is bounded by 

Oli i j^fc"i ^ j for any constant k. Note that an upper bound of the expected 

number of vertices at depth d is given by 

b'^P{n^ > 2), (86) 

where f is a vertex at depth d — 1. Note that this is true even in the case 
So = 0, since for all internal nodes n^, > s + 1. Choosing t > 0, an application 
of the Markov inequality implies that 

PK > 2) < PKK - 1) > 2) < 

p«K - 1)' > 2*) < ^^<^''^^-^y\ (87) 

Thus, an upper bound of the expected profile for the vertices at depth d is 

6'^E«(n,-l)*), (88) 

where f is a is a vertex at depth d — 1. 

First we show that the number of vertices v (assuming Si = 0) where 

d{v) > fi~^\nn + ln°'^'''^n is bounded by Oli i ^^^"1 ^ j . We prove this by 

choosing t = — |^, where e{n) > is a decreasing function of n that we 
specify below, and show that 

± 6^E(n.!^K-l)^)=0(^). (89) 

Let Xd be a mixed binomial (n, Y[j=i ^j)^ where Wj, j G 1, . . . ,d are i.i.d 



random variables distributed as V. To show (89) it is enough to show that 
the expected value of 

± 6-E(.Yf^(.Y,-l)'^|%), (90) 

is O I iJli ] . That this is enough follows because of the bound of n^ in ( 84 ) , 
since we assume that si = 0. Suppose that e{n) < 1, thus the Lyapounov 
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inequality (which is a special case of the well-known Holder inequality) gives 



e(x, ^ (X,-l) 



l + £(n) 
2 






<(nnw^,) 



l+€(n) 



(91) 



Hence, to show (89) we deduce from the right hand-side of the second in- 



equality in (91|) that it is enough to show that 



d=[^-ilnn+ln''-5+^nJ-l 



l+^W^I^°'^l+e(n) ^ Q 



n 



ln'=+^ n 



(92) 



Taylor expansion gives 



W 



l+e(n) 



^^.^e{n)lnW, ^ ^, (. ^ ^(^) j^ ^^. ^ ^^^2^^-^ 



+ o(Wj€^{n)\n^wX 



(93) 



Thus, by taking expectations in (|93|) we get 



d=[At-ilnn+lnO-5+=nJ-l 

oo 

E 

d=[At-llnn+lnO-5+=nJ-l 



o 9 

1 - /ie(n) + ^4^e2(^) ^ 0{t\n)) ) n^^^^") 



In 



E 

d=[At-llnn+ln°-5+'nJ-l 

l-^e(n) ln-0-^+' n+O ( e\n) ) 



/ie{n)+^e2(„-)_,_0(^3(„)) j d+lnn(l+£(n)) 



c 



en 



(94) 



Hence, by choosing e(n) = 51n n for some constant 6 (that we choose 



small enough) we get from the last inequality in (94) that for some constant 
-B > and any constant fc. 



S^ = 0[ne-^'^"''^\ =0' "" 



In'^+i n 



(95) 
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let t 



We argue similarly for the vertices v, d{v) < fi ^Inn ~ In ' ^^ n. In (88) 
where e(n) = 5\n~^'^^'^ n for some constant S as above. In 



i-^(") 



analogy with (89) an upper bound for the expected number of vertices v 
with d{v) < /i~Mn?T, — In ' """^ ra is 



[At-ilnn-lnO-5+^nJ 



S,:-- 



l-e(n) 

d=0 



1) 2 ). 



We use similar calculations as in (92)-(95) to show that 

w— n V / 



(96) 



(97) 



This implies in analogy with (89)-(92) that for some constant B and any 
constant fc, 



s^ = o{ 



ne 



-Bln^^ 



^)=o[ 



n 



In'^+i 



n 



(98) 



Hence, if si = the number of bad vertices is bounded by C^i ( — ^^1 
for any constant k. Thus, it follows from our previous explanation that the 



number of bad vertices for arbitrary si > is bounded by Oii 



In'' 71 / ■ 



D 



Remark 4.2. We note from (94), (95) and (98) that we in fact get a sharper 



bound for the number of bad nodes, i.e., Ojne ^ '"^ ""^ for some constant 
B' >Q. 



Remark 4.3. From the calculations in the proof of Theorem 2.2 in particular 



in (94), we see that we can get a much smaller error term for larger depths. 



i.e., for any constant r there is a constant C > so that the number of nodes 
with d{v) > Clnra is bounded by (9i^i(^). 



4.3 Proof of Theorem [273 

Proof. We write 



Dn — fi ^ In n 



v^ 



(99) 



n 



By a classical result in probability theory, see e.g. pUl Theorem 5.5.4], the 
limit law in ^ implies that (16) holds if Z„ is uniformly integrable. In par- 
ticular this is true if Z^ is uniformly integrable. This uniformly integrability 
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also gives 



E{zl) ■■- 



E((D„-;f/-ilnn)') 



Inn 



^E(iV(0,aV')' 



aV'- 



(100) 



Furthermore, the convergence results in (16) and (100) imply (18) for k 



By using the same coupling argument as in (p) it is easy to show that the 
convergence result of the expected depth in (16| implies the convergence 



n. 



result of the expected average depth in (17). 

Thus, it remains to show that Z^ is uniformly integrable and that (18) for 
k = n implies that (18) also holds for j^ < A; < n. By a standard argument, 
see e.g. pUf Theorem 5.5.4], Z^ is uniformly integrable if for some p > 1 and 
Uq large enough. 



supE(|Z^|P) : = supE 

n>no n>no 



{Dn -/i Mnn)' 



Inn 



(101) 



is uniformly bounded. We choose p = \- We show that this is true by using 
similar calculations as Devroye used in [3] for proving the limit law of Dn in 
(Ig]). First, consider an infinite random path Ui,U2, ■ ■ ■ , in the skeleton tree 
Sb, where Ui is the root. Given Ui and the split vector V^ = (Vi, . . . , VJ,) for 
Ui, then Uj+i is the j-th child of i with probability Vj. Construct a random 
split tree with n balls and let u* be the unique leaf in the infinite path. Then 
by using a natural coupling, letting the n:th ball follow the random path, Dn 
is in stochastic sense less than or equal to the distance between u* and the 
root. In the coupling D^ is less than this distance, if the n-th ball is sent 
to a leaf which splits and does not send this ball to one of its children (i.e, 
the n-th ball is one of the sq balls). If the n-th ball is one of the si balls it 
is added to a child of p{u*) (the parent of u*), i.e., it ends up at the same 
depth as u*. Recall that Hn denotes the height of a split tree with n balls. 
For all /3 > we have 



p(d„ > A; + /3) < P(n(«fc) > p) +P(^Hp > /?), 



(102) 



and 



p(D„,<k)<p(n{uk)<s + l]. (103) 

Recall that A = Vs, where given (Vi,...,Vb), S = j with probability Vj. 
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Then n{uk) is stochastically bounded by 

k k 

n{uk) < mBin(n, 1 I '^j) + niBin(si, 1 I ^j) 

k 

+ mBin(si, JJ A^) + ■ ■ ■ + mBin(si, A^) + si, (104) 

i=3 

where A^ are i.i.d random variables distributed as A. 

Consider the probability P(-D„ > k + (3), where k = [/x~^ Inn + | Vln n\ 
for X G R~^. We bound this by bounding the probabilities in the right hand- 



side of (102), choosing /3 = [f In ' (n)J . First note that the bound of n{uk) 



in ( 104 ) implies that in stochastic sense 

k k 

^(wjfc) < mBin(?T,, ] [ Aj) + mBin(si, 1 I A^) + . . . 

J=l 3=1 

k 

+ mBin(si, II A,) + [^ln'-'n\. (105) 



Thus, we can bound the first probability in the right hand-side of (102) by 
P{n{uk) > /3) < P ( mBin(n, JJ A^) + [— ln°-^ n\>/3-l 

(k k \ 

mBin(si, JJAj) + --- + mBin(si, JJ Aj)>lJ. (106) 

For bounding the first probability in the right hand-side of the inequality in 



(106), we use [Sj Lemma 4] which states a general result for bounding tail 
probabilities for mixed binomial {m, Z) distributions where Z is a random 
variable, thus we obtain 



Pi := P fmBm{n, f[ Aj) > (3 - [— ln°-i n\ - l\ 

V j=i / 
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From (107) we deduce that for n large enough 



Pi < P 



^^ .-^ . >- Inf ^-L^r-^'M +fe/i 



E,=i In Aj + /c/i 



Ta?^ 



> 



2n 



Ta?^ 



^ p,'E^=^]nVt^ ^ x^ >| _^ /e 



\fk 



a^ 



3ct 



+ 



/3-L^ln"-^nJ-l 



(108) 



Recall the notations c := E (A) , /i =: E ( — In A) and a^ =: Var ( In A) . Note 
that c < 1. Since the Aj, j G {1, . . . , fc}, are i.i.d random variables we can 
use the Marcinkiewicz-Zygmund inequalities, see e.g. jTUl Corollary 3.8.2], 
which gives for q > 2, 



e( ^InAj + fc/i j < BykiEl lnAj+/i J 



(109) 



where Bg is a constant only depending on q. By using the Markov inequality 



and ( 109 ) we get from ( 108 ) that for n large enough 



E 



Pi< 



ELilnAj + fc/i 



v^ 



a" 



3N 4 

l5^ 



S.E 



< 



In Aj + /i 



X/i2 



,3_Lil±ln'J-tnJ-l 



c 



x^ 



,3_Lil±lnU-tnJ-l 



for the constant C = "^ e — < oo (recall from section 

moments of | In A| are bounded). The Markov inequality implies that 

(k k \ 

mBin(si,JjAj) +--- + mBin(si, JJ ^i) > 1 ) 

(k k \ 

mBin(si, JJ Aj) + • • • + mBin(si, JJ A^) j 

= C('cLi'>^°'"jy for c := E(A) < 1. 



(110) 



that all 



j=A;-[|ln''-lnJ+l 



:iiii 
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We now consider the other probabihty i.e., P[Hj3 > /3j. (Note that this 

(112) 



probabihty is if Sq > or si > 0.) By applying (86) we get 

p(Hp> p) <b^p(n{v) >2 



where w is a vertex at depth 13 — 1. From (87) we deduce for t = 0.75, 

P(n. >2)<E(n°-^5(n,-lf^5). 



:il3) 



Let Xp be a mixed binomial (n, Y[i=i ^j)^ where Wj, j G {1, . . . , /3} are i.i.d. 



random variables distributed as V. Note similarly as in (90) that (113) is 
bounded by the expectation of 



e(x°-^5(x^-i)o-^5|^^ 



(114) 



We note similarly as in (91) that the Lyapounov inequality gives 



/3 /3 ^g 

e(x°-^^(X^ - lf''\^p) < {P'-Pf^'Hw}-' < {^U^j) ■ . (115) 

Again the fact that E{Wf) < E{Wj) = I (since Wj G (0, 1)), gives that there 
is a 5 > such that 



p(hp >/3j <b-^^P^-^. 



(116) 



We now consider the probability P(-D„ < /c), where k = [fi ^ Inn — xa/Iu n\ 
for X G R~^, and use the bound of the larger probability in (103). We have 



Pin{uk) < s + 1) < P( -ks + Bm{n,Yl^j) < s + 1 J. (117) 



Again by applying [3l, Lemma 4] and using similar calculations as in (107)- 



(110), we get for n large enough 



Pi<V 



^Jka^ 



< 



\fko^ 



+ 



2\ s(fe+i)+i 



^pj T-=ilnA, + A;/i ^ ^^ ^ /2x-('=+i)+i 



BaE 



In Aj + /i 



3a 



< 



X/iS 



'2\s{k+i)+i 1 

+ I-) =c 



2ss{k + l) + l 

4 ■ vt) . (118) 
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for the constant C = '^ & — < oo. Now we can show that for Uq 



large enough sup^^^^^ E(|Z^|2) in (101 ) is uniformly bounded: By the choice 
of k and /3, we get from (110), (111), (116) and (118) that for for uq large 
enough 



supE(|Z^|i) : = supE 

n>no n>no 



{Dn - fi ^Inn) 



2 3 

2 



Inn 



sup / 3a; P 

n>nQ Jx=0 



Dn — yU ^ In n 



VE 



n 



< sup 



n>no [^ Jx=l \ ^ 



6C 



> X jdx 



+ 3x' 



+ 3x' 



2\s{k+l)+l 



o(^xh^^2^''"'''\^ + 3x%~^^ f3'Adx\ + 1< oo, (119) 



and thus Z^ is uniformly integrable so that (100) holds, which shows (18) for 
k = n. 



From this result it is now easy to show as we explain below that (18) also 
holds for all k, ^ < k < n. Recall that we denote the depth of ball k, when 
it is added to the tree by Dl. As we argued for proving (jSj), in stochastic 
sense for k < n. 



Di<Dk< Dr. 



(120) 



From (161) it follows that for all j^ < k < n, 



DI — ji ^ In n 
a'^jj,^^ Inn 



4a^(o,i) 



By using this and (120), for p- < k < n 



Dh — a ^\n.n d 

/,,, 4iv(o,i). 



We need to show that for t^ < k < n, 

Inn — — ' 



E((Dfe-/i-ilnn)' 
Inn 



—?■ 



E(iV(0,aV^)' 



As for Dn this follows if for no large enough, 

(^Dk — 1^^^ Inn) 



sup E 

n>no 



2 3 
2 



Inn 



< oo. 



(121) 



(122) 
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We have for k < n, 



V\^ 



n 



> X 1 < P 
<P 



Dk — /i ^ In n 

Vlnn 
fi^^ Inn 



Dr, 



and 



Dn — fJ' ^ In n 



Vh^ 



n 



<a; 1 < P 
<P 



\/\nn 
Dk — f^~^ Inn 



> X 

> X 



Vh^ 



n 



Dl — fx ^ In n 



VE 



< X 



< X 



n 



Thus, ( 122 ) fohows from the calculations in ( 119 ). This shows that ( 18 ) holds 
for all k, p- < k < n, follows from the fact that (18) holds for k = n. D 



We now prove the two corollaries of Theorem 2.3 



Proof of Corollary 2.1. We show (20) and from this it is obvious that (19) 



also holds since Theorem 2.2| implies that the bad vertices are few enough so 
that we could equally sum over all vertices. 



First note that (121) gives that for the balls [j^J < k < n, 



E{Dk-lJ, Mnn)^ = /i ^a^lnn + o( Inn). 
Recall that a vertex f in a split tree T is called good if 
/i"^ In n - ln°-^+' n < d{v) < fi'^ In n + ln°-^+' 



(123) 



n, 



and that we write V*(T") for the set of good vertices in T" and A^ 



2.2 



|V*(T"')| for the number of good vertices. Note that (14) and Theorem 
implies that 



E{N*) = an + o{n). (124) 

We will now consider subtrees defined similarly as the Tr^B, r E R, sub- 



trees we used in the proof of Theorem |2.1[ However, instead of using the 
product M^ for defining the stopping time in each branch we use the real 
subtree size n^: Let U be the set of vertices such that m G f/, if and only if 
TT-u < In'^'^ra and np(^u) > ln°'^n (where p{u) is the parent of u), and consider 

1 4 

all subtrees T.^ ' ", u E U, rooted at u. 
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It is an immediate consequence of Lemma 4J^ that the first and second 
moment of the height of a subtree with ln°'^ n balls is bounded by O i ln°'^ n j 

and Oi In^*^ n j , respectively. However, there are much stronger bounds, e.g., 

[3] since split trees are of logarithmic order. Hence, since the subtrees are 
small by applying that d{v) — fi~^ Inn = {d{u) — fj,^^ Inn) + {d{v) —d{u)) and 
summing over all good vertices we get 






ofnlnn). 



(125) 



In (125) we use the bound for the good vertices, but it is obvious from 



Theorem 2.2 that the bad vertices are few enough so that one could equally 



sum over all vertices. The number of subtrees that hold the balls k < I r^ I 

L In n -I 

is trivially bounded by [j^J . Thus, the number of nodes in these subtrees 
is bounded by C'^i (^j^V Let A^* = |V*(ry'")| be the number of good 

1 4 1 4 

vertices in T^'^ ' ". Hence, by applying that the subtrees T^° ' ^, u G U, 
are small so that {d{v) — /x^^lnn)^ do not differ more than (9( In ' n) for 

1 0-4 I I 

different vertices v E T^ ' ", together with (124) we get 



E^^ ^ (rf(u) -/i"Mnn)V« 



v<^y 



(rh.o.4„) 



A;er„ 



n„ 



E((A-/i-Mnn)2K) 
2^ 2^ {ariu + o^riu)) + Oii(nlnn) 



(126) 



u keTu 



Recall from (45) in Lemma 4.3 that 



E(iV) = e(^ iV,/{|n, - M;| < 5°-^) +0^^^ 



reR 



where M^ by definition is less than B. If we choose B := In^'^n this means 
that for the expected value in the right hand-side we can assume that n^ < 
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In ■ n. Hence, the expected number of (good) vertices in T"' that are not in 

the subtrees T^'^", u eU, is bounded by o(^j for B = ln°-^ n. Hence, 

this bound imphes that the expected value of the last equality in ( 126 ) is 
equal to 

Y^ E({Dk - /i^Mnn)^') {a + o(l)) + o(nlnn) 
= ifx^^a^lnn + o(lnn) j {an + o{n)) + o(nlnn). 

n 



Proof of Corollary \2. 2 . As in Corollary 2.1 we only show the result for the 
good vertices, i.e., p2). From the proof it is obvious that also (21) holds by 



applying Theorem |2.2[ showing that the number of bad vertices is covered 
by the error term. We observe the obvious fact that the sum of those rii, i & 
{1, . . . ,b^}, which are less than rfr for large enough k, is bounded by 






(127) 



(Note that by choosing k large enough in (127), the power of the logarithm 
can be arbitrarily large.) 

Recall that V* (Tj) is the set of good vertices in Tj and that ^2^;, is the 
cr-field generated by {n^, d{v) < L}. Let 

{di{v) -/i"Mnni)2 



E 

I'GV* (Ti) 



/^" 



-3 1n3 



Ui 



Thus, from (20) it follows that 



EE Z: 



a aui 



E 



o{ni) 



2„ +Z^1„2 



In n. 



In rij 



Let A; > be a fixed constant and assume that ni is at least p^; by Taylor 
expansion we get 

1 



In n, 



1 ,^ / In In n 

V In'^n 



In n 
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By applying (127) for k large enough to cover those rij that are less than -^ 
in an error term o(j^^), and using (128) we deduce 



E 

4 = 1 



In Ui 




+ o — ^— = o — ^ — . 



(129) 
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Hence, since we can assume that rii is at least -^ for large enough /c, by 
Taylor expansion 



^^2 2 

a arij a an 



bL 



i=l 



In m In n 



n,lnlnr2\ a'^an 



EO Uiliq O ULli ^^ I V ^ f Lq 11± 111 ft \ KJ \JLfL / I L \ , , 



i=l 



\n^^n 



n 



In n In n' 



Since only the good vertices are considered, and the random variables 
conditioned on Vt^ are independent for i G {1, . . . , fe'^}. 






i=l 



i=l 



i=l 



ln"-2^n, 



Vt, 



(131) 



Thus, the well-known Minkowski's inequality and the fact that E(A^^) 
0{v?) imply 



b^ / b^ 



nr 



ln^-"^n, 



(132) 



Similarly as in (59) for /3 large enough. 



b^ 



1=1 



in n 



Applying (133), Chebyshev's inequality gives (22). 



(133) 



n 



5 Results on the Total Path Lengths 

We complete this study with some results and a conjecture of the "total path 



length" random variables. Recall from Section |1.2| the definitions of the two 
types of total path length \1'(T) and T(T), i.e., the sum of the depths of balls 
and the sum of the depths of nodes, respectively. 



From (17) we have 



E('^(T")) =/i"^nlnra + ng(n). 



(134) 



where q(n) = o(ln ' n) is a function that depends on the type of split tree. 
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Similarly, by using(14) in Theorem 2.1 and the profile result in Theorem 



2.2 including Remark 4.3 (which gives a smaller bound of the expected num- 



ber of vertices with depths much bigger than the depths of the good vertices), 
we get 



E 



fT(T"')j = fx ^anlnn + nr{n), 



(135) 



where a is the constant that occurs in (14) and r{n) = o{\nn) is a function 
that depends on the type of split tree. 



(A2). Assume that the functions q{n) in (134) converges to some constant 

In [17] there is an analogous assumption. Examples of split trees where it 
is shown that q{n) converges to a constant are binary search trees (e.g. [8]), 
random m-ary search trees [T3] , quad trees [TTj and the random median of a 
{2k + l)-tree [T^, tries and Patricia tries P]. 



(A3). We assume that the result in (14) in Theorem 2.1 can he improved to 

E(A^) = an + f{n), 



where f{n) = o(^:^^^). 



Stronger second order terms of the size have previously been shown to 
hold e.g., for m-aiy search trees [121, for these f{n) in assumption (A3) is 

o{y/n) when m < 26 and is 0{n^~^ ) when m > 27. Further, as described in 



Section |1.3 tries are special cases of split trees which are not as random as 



other types of split trees. Flajolet and Vallee (personal communication) have 
recently shown that also for most tries (as long as — In V is not too close to 
being lattice) assumption (A3) holds. 

Theorem 5.1. Assume that (A1)-(A3) hold, then also r{n) converges to 
some constant (. 



Let 



and note that 



r„ := anqiji) — nr{n), 



aE('*(T")) - e('t(T")) = r. 



(136) 



(137) 
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For proving Theorem 5.1 we will show that — converges to a constant. We 



write X] D fo^ ^ s^^^ where we sum over all vertices v G T'^ except the root 
i.e., V ^ a. First we recall that the total pathlength for the balls is equivalent 
to the sum of all subtree sizes (except for the the whole tree) for the balls 
i.e., 



vl/(T") = Y, n., 



:i38) 



where a is the root of T". Similarly we recall that the total pathlength for 
the nodes is equivalent to the sum of all subtree sizes (except for the the 
whole tree) for the nodes i.e., 



T(T'^) = J2^^ 



(139) 



where a is the root of T^\ Hence, by assuming (A3) we get from (137) that 



ai 



E 



ME 



An'+'ur 



* / 

E< 



n. 



(140) 



Vlog^+'n„. 
We will again consider the TrB, ^ & R, subtrees from the proof of Theorem 



2.1 



in Section Isl (defined such that M^ := nHf^lWj < B and M^^^^.^ : 
n ]^ 4 1 ^j ^ B). However, here we choose B differently, i.e., B = e^"^^ 

Lemma 5.1. Assume that (A1)-(A3) hold, then 

aE('^(r")) - E('T(r")) = r„ = 0{n). 



Furthermore, 



r„ = ^E(r„„)+o(n), 



(141) 



(142) 



re-R 



where 



r„, = aE{^!{Tr,B)\nr)-'E.{i:{T,^B)\nr). 



(143) 
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Proof. Assuming (A3), we get from (140) that 



riv 



Eo(^-^)-o("). 



(144) 



where we apphed Corollary 3.1 in the last equality. In the same way, for the 
vertices which are not in the Tr^Bi r & R, subtrees (ignoring the root a) we 
deduce that 



E 



( ^ -id^ 






log ^' n^ 






n 



2k J.l+e 



\ = o{n). (145) 



Hence, (142) follows from (145) and (144) 



D 



Proof of Theorem 5. 1 We will use the same type of proof as the proof of 



(14) in Theorem 2.1 We start with two arbitrary values of the cardinality n 



and n, where n > n, and show that 



-L n 

n 



^ n 

n 



0(t) as n ^- oo, Ve > 0. 



(146) 



Since (146) implies that -^ is Cauchy it also converges to some constant as 



n tends to infinity; hence, we deduce Theorem 5.1 Recall from the proof of 



Theorem |2.1| that a main application for the proof is to use (39) in Lemma 
4.1, Here we use an analogous applications of (144) in Lemma 5.1, i.e.. 



Tn = 0{n). 



Recall that we prove Lemma 4^ by showing 



(147) 



and then applying (39) in Lemma 4.1 In the same way by using (147) and 
(141) as well as (|142[) in Lemma 5.1 we get that 



r„ = E(5^r„,/{K-M;|<5°-n)+o(n) + o(-^). (148) 



reR 
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Recall that R' (1 R is the set of vertices such that r G -R' if 

|nr-M,"|<5°■^ (149) 

and that R" C R' is the set of vertices such that r G i?" if r G R' and 

eB < M^ < B. (150) 



Lemma 43 shows that we only need to consider the vertices in r G R' . 
Similarly as in (50) we get 



r„ = E(5:r„,)+oH+o(^), 



(151) 



refi" 



Recall from the proof of Theorem 2.1 that we sub-divide the T^.^^, r E R, 
subtrees into smaller classes, wherein the M^, r G -R, in each class are close 
to each-other. As before we choose 7 = e^, and let Z = {B, B — ■yB, B — 
2jB, . . . , eB}, where e = | for some positive integer k. Recall that we write 
Rz ^ R, z E Z, for the set of vertices r E R, such that M" E [z — yB, z) 
and M^(^) > B. Hence, (|l5l| gives 



r" = E(E E r..)+0(en)+o(^). (152) 



z<^Z reR'nR^ 



To approximate the expected value in ( 152 ) we apply a lemma that is similar 
to Lemma [4.21 

Lemma 5.2. Adding K halls to a tree can at most have an influence on 
Ef^(r")) an(iEfT(T'^)), respectively, by 0{Kln{n + K)). 



Proof. Adding one ball to a tree with n balls the expected depth is 0{lnn) 
and the expected number of additional nodes is C(l). Note that 0{1) nodes 
only have distances 0{1) between each other.) Hence, when the K-th ball is 
added the expected depth is C(ln(n + i^)). Since K balls give an expectation 

of 0{K) nodes the result holds for both E h{T")) and E f T(r")) . D 



Let Tz be an arbitrarily chosen node in R' (1 Rz, where z E Z. Similarly 
as in ( [52| ), by using (149) and Lemma [5. 2| from ( 152[ ) we get 

Tr, = J2n\R'r^Rz\){Tn.^+0{yB\nB))+0{en)+0[-^). (153) 



Define b^ in a tree with cardinality [x\ as 62 



5.1 



"TT^, and note from Lemma 
that b^ = C(l). Recall that 5 = {1, 1 - 7, 1 - 27, . . . , e}, where 7 = e^. 
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Recall from (53) that for each choice of 7 and a G 5, there is a a^ such that 
for a constant Cq, (depending on a), 



E(|i?'ni?„B|) 



n_ 
B 



- Cn 






(154) 



whenever ^ > — . By choosing B = e ^^ for n large enough ^ > — so that 



(154) holds. Moreover, since ^^es '^^ = ^(1) ^^ have that ^^es Ca X*^ ~ 
0(7 In e). Re call t hat 7 = e^. Thus, for a constant c^ (depending on a) and 
6„B = 0{l), (|l53|) and (|l54|) imply that 



Tn = nJ2 Ca^{baBaB + 0(57 In 5)) + ri J] 0(6,^72) + 0{en) 

aeS aeS 



n 



2ja&aBCa + 0[en). 



aeS 



In analogy, also for n > n, 



aeS 



Thus, (146) follows, which shows Theorem 5.1 



(155) 

(156) 

D 



Finally we present a theorem that is applied in |12j . 



Theorem 5.2. Let L = [/^log^lnnj for some large enough constant /3. As- 
sume that (A1)-(A3) hold, then 



2^ ,,-21^2 



nq 



l^'~ 



Hi 



ST^ -I- -\- 

^ a-^ In Hi u-'^\v?n ^Vln^n 
1=1 ' 



(157) 



and 



i=l 



T(T,) 
/i~21n^ nj 



b^ 



77^ 



an,- 



+ 



nC 



+ 0. 



^ /i Unnj fj,-'^\n'^n ^\hi^n 



n 



(158) 



ments for showing (157). 



First, (135) gives 



Proof. We only show (158), since we can use exactly the same type of argu- 






fi, 



nir[ni) 



2=1 J=l ' 



(159) 
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Note that conditioned on Ql, the summands T(Tj), i G {1, . . . ,b^} are 
independent. By applying the Cauchy-Schwarz inequahty, and using the 
facts that E{N'^) = 0{n'^) and that E{DI) = C(ln^n) for all k, we deduce 
that 



Var(^T(r,)|fiJ =5^Var(T(T, 



n, 



(160) 



Similarly as in (133), for any constant k (and choosing the constant /3 in L 
large enough) the following holds 



fe^ 



E E 



■n.. 



«=i 



n 



In'^n 



(161) 



Thus, for a large enough constant /3, by taking expectations in (160) we get 

(162) 



-..(E;^h)^»(^ 



Using (159) and (160) and applying (162), the Chebyshev inequality results 



in that conditioning on D.^, 



E 



T(T,; 



-^ fi~Hia^ rii -^/i-Unrij ' -^/x-^ln^n. 



//-linn. .^— ^ 



+ 0^ 



n 



P\ 1 2 

m n 



(163) 



By applying Theorem 5.1, (128) and (129) we get 



E 

i=l 



/i 



nir[ni) 
Hn^ rii 



2^ ,,-21v.2^. "^Z^ 



i-21n2' 



+ 



In^ iii 



C^i sr^ o{ni) 

1=1 '^ ' 4 = 1 

In n 



n 



In n 



Thus, (158) follows from (163) and (164). 



(164) 
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